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Abstract. We present an analytical framework for gossip protocols based 
on the pairwise information exchange between interacting nodes. This 
framework allows for studying the impact of protocol parameters on the 
performance of the protocol. Previously, gossip-based information dis- 
semination protocols have been analyzed under the assumption of per- 
fect, lossless communication channels. We extend our framework for the 
analysis of networks with lossy channels. We show how the presence 
of message loss, coupled with specific topology configurations, impacts 
the expected behavior of the protocol. We validate the obtained models 
against simulations for two protocols. 



1 Introduction 

Gossip protocols have emerged as an attractive solution for distributing informa- 
tion in large-scale systems, due to their simplicity and efficiency. Randomness 
and the distributed nature of gossip protocols considerably increase their ro- 
bustness compared to deterministic protocols, also in the presence of failures 
and data loss. 

Although gossip protocols are characterized by their superficial simplicity, 
the large-scale behavior of a gossip system is not easily predictable. From the 
perspective of protocol design and behavior prediction, this is an undesirable 
situation, and with the growing number of gossip protocols, there is an increasing 
demand for their analysis in an insightful and systematic way. 

However, the formal analysis of these protocols is still a rather unexplored 
research field with many challenges, in part caused by the fact that traditional 
techniques quickly lead to a state-space explosion (see, e.g., [14, 10, 2]). In [6], we 
have proposed a modelling framework for gossip-based information dissemination 
protocol, in which each node periodically selects a random peer and shuffles its 
local data. In this paper, we show that this approach is generally applicable 
to gossip protocols for information dissemination, and we extend the analysis 
method to networks with lossy channels. An exchange of data between nodes is 
modelled as a state transition capturing the presence or absence of a selected 
data item before and after a gossip between two nodes. 



Our modelling framework allows for the prediction of the system behavior in 
large-scale environments. We propose a model based on local pairwise interac- 
tions which in combination with model checking (cf. [5]) or with the mean- field 
framework (cf. [3]) allows for the analysis of the emergent behaviour of the sys- 
tem. The model can also be used for optimization of system parameters and 
performance (cf. [6]), and for fast event-based simulation, as it is done in this 
paper. 

We introduce our theoretical framework initially assuming no communica- 
tion failures in the network. Later, we show that the framework is applicable 
to networks where nodes communicate over an unreliable medium. An impor- 
tant part of our proposed framework is the probability Pdrop that an observed 
item in a node's local storage has been replaced by another item after a gossip. 
We derive the expressions for this probability under the assumption of perfect 
communication medium for two protocols, Newscast [17] and Shuffle [15]. In 
the presence of message loss, however, the dependence of the probability on the 
type of underlying network graph emerges. We deconstruct the expression for 
Pdrop into its main components, and identify the ones that depend on specific 
scenario configurations, i.e., message loss and topology combinations. One such 
component is the probability of an observed item found at one gossiping node, 
to be also found at its peer. Since it is not feasible to cover all possible network 
topologies in one formula, the computation of this probability is based on sta- 
tistical data that can be obtained from Monte Carlo simulations of the protocol 
at hand. 

Since the work of Demers et al. [11], gossip protocols often follow bidirec- 
tional data exchange (push-pull) for better performance. As observed in [13], 
the performance of these protocols is usually evaluated under the assumption 
of a perfect, error-free communication medium. Thus, data exchange operations 
between nodes are generally considered to be atomic operations, e.g., [21, 7, 22, 
18,24,15,12,6]. That is, if a node initiates a gossip with another node, both 
nodes base their local decisions upon each other's data, and in some protocols 
even guarantee the preservation of each other's data. In practice, however, im- 
plementing data exchange as an atomic operation is hard to achieve assuming 
communication over unreliable media [23]. 

In this respect, our paper makes an additional contribution; it investigates 
the impact of a realistic environment with lossy communication channels on 
a push-pull-based gossip protocol. Furthermore, we present in more detail the 
following observations: 

• Introducing message loss in the network model affects the emergent behavior 
of the protocol in a specific manner: with the introduction of lossy communi- 
cation channels, the correlation between local storage content of neighboring 
nodes increases, and the degree of the correlation depends on the network 
topology. 

• The fewer neighbors nodes have in the underlying network, the stronger the 
effect of message loss on the emergent behavior of the protocol. For fully 
connected networks, message loss does not have an impact on the distribution 
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of data over the network, which remains uniform (as observed when there are 
no losses). 

Two areas of research are relevant to our work: generic modelling frameworks 
for gossip protocols and the performance of gossip protocols in the presence of 
message loss. Automated mean-field framework for dynamic gossip networks has 
been presented in [4]. Due to the imdcrlying mean- field method, used by this 
framework, the results arc accurate only for very large networks and for average 
behavior of the modelled system. The mean-field framework can be combined 
with our framework, as it is shown in [3] for Shuffle. 

There is previous work [16] on a simple gossip-based membership protocol 
with nonatomic protocol actions in the presence of message loss of up to 1%. 
The authors proposed to use a push-based gossip protocol, in which only an 
active node sends data to its peer, and immediately removes the sent data from 
its cache. Moreover, the node sends only two items, IP addresses and ports of 
itself and of a random peer from its local cache. In the protocols that we study, 
each gossiping pair exchanges a random subset of the data items. The authors 
show the protocol correctness modelling it by graph transformations, similar 
to the approach in [10]. The protocol properties analyzed include the expected 
number of neighbors a node has and the uniformity of the neighborhood lists. 
The theoretical results in [16] were not compared with experimental evaluations 
of the gossip protocol. The scope of their paper differs from ours: it focuses on the 
correctness analysis, leaving out performance analysis of the proposed protocol. 

The paper is organized in the following sections. Sec. 2 outlines the class of 
gossip protocols that can be analyzed with our framework, presented in Sec. 3. 
We consider two case studies. Newscast and Shuffle, and demonstrate the mod- 
elling and derivation of the state transition matrix for them, in Sec. 4 and Sec. 5, 
respectively. Sec. 4 demonstrates the modelling of push and pull versions of News- 
cast. In Sec. 6, we model and analyze Shuffle in the presence of message loss. 
Sec. 7 stretches the scope of our theoretical framework to gossip-based member- 
ship protocols like Cyclon. Sec. 8 concludes the paper. 

2 Background 

In this section, we specify a class of gossip protocols for which our modelling 
framework is applicable. We consider large networks, where nodes interact in 
a pecr-to-peer style. All nodes have a common agreement on the frequency of 
gossiping. Each node stores local data in its cache and executes two different 
threads, an active and a passive one (see Fig. 1). The active thread periodically 
initiates a contact with a randomly chosen peer p by sending it a (sub) set a of 
its cache, and waits for a reply. Upon reception of the reply ap, the node updates 
its cache based on the cache, a and ap. 

The passive thread waits for a message sent by the active thread of a neighbor 
and replies to it with its data a. The node then updates its cache based on the 
stored, received and sent data. The exchange message with its content is called 
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wait (At time units) 
p <— RandomPeer(); 
a PrepareMsgO; 
send a top; 



wait until receive((7p) 
a PrepareMsgO; 
send a to sender( cTp); 
a Update((7, cTp); 



wait until receive((Tp) 
cr <— Update((T, cTp); 



(a) active thread 



(b) passive thread 



Fig. 1. Skeleton of a gossip protocol 



an exchange buffer. Nodes that execute their active thread are initiators, and 
nodes that execute their passive thread are called contacted. 

The generic protocol, described above, is a push-pull gossip protocol. Other 
variations of gossip protocols are ones in which the initiator only sends local 
data to its gossip partner (push), and ones in which an initiator only requests 
data from its peer (pull). All three versions of gossip protocols are covered by 
our framework. 

The random peer selection Random Peer() is based on the set of neighbors as 
determined by the underlying network graph. The nature of the data and the 
result of the generic operations in Fig. 1 are application-dependent. In gossip- 
based information dissemination protocols like [18, 15], a finite list of news items 
composes the local cache of a node. The operation a PrepareMsgO in Fig. 1 
selects a random (or predefined) set of items from the cache of the node. The 
method Update merges the list of old items with the list of received items. Prop- 
erties of the protocols that can be analyzed within our framework include the 
number of copies of an item in the network over time and the speed at which 
items spread throughout the network. 

In gossip-based membership management protocols, such as [24, 1] , a cache 
of each node consists of a finite set of its peer IP addresses, so-called the partial 
view of the membership of the network. The method Update produces a sample 
of the union of the old and the received views. The performance metrics of these 
protocols include a distribution of the partial view size, and the number of nodes 
reached in the presence of node failures. We refer to [9] for other applications of 
gossip protocols. 

As we demonstrate later in this paper, our framework models gossip-based 
information dissemination protocols, but it can be also applied to other type of 
applications, such as gossip-based membership protocols. 

3 Pairwise Interaction Model 

An exchange of data items between nodes is modelled as a state transition, 
capturing the presence or absence of an observed data item d before and after 
a gossip interaction between two nodes: an initiator A and a contacted node B. 
Each state is then a pair (a, b) of bits, each indicating the presence (if equal to 1) 
or the absence (if equal to 0) of the item in the cache of A and B, respectively. 
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P(00|00) P(01|01) 




P(10|10) P(01|01) 



Fig. 2. Pairwise interaction model 

The model has four possible states of the caches of A and B: (1) when both 
hold d, (2-3) either of the caches holds the item d, and (4) neither cache holds 
d. These correspond to the states (0,0), (0,1), (1,0), and (1,1) in the state 
transition diagram, shown as Fig. 2. Transitions from one state to another are 
labelled by the respective transition probabilities P(a2&2|ai^i), where aibi is 
the state before a gossip interaction, and 02^2 is the state after the interaction, 
with Qi^bi e {0,1}. ai, 02 and 61, &2 correspond to states of nodes A and B. 
For instance, P(01|10) means that node A had d before the interaction, which 
it passed on to B, afterwards. 

The building blocks of all transition probabilities -P(a2&2|ai&i) of the state 
diagram are two probabilities: (i) the probability Pseiect of an item to be selected 
by a node from its cache for a gossip, and (ii) the probability Pdrop that an item 
is replaced by another one, received by its node in the gossip. The expressions for 
Pseiect and Pdrop depend on specifics of the operations PrepareMsg and Update, 
respectively. Moreover, these probabilities are functions of the number of items 
n, exchange buffer size s, and cache size c, and < s < c. We assume that c < n, 
i.e. all n items cannot be stored in a single local cache. We explain our framework 
by considering state transitions and computing the corresponding probabilities 
for several case studies. 

Our state diagram differs from a Markov chain of the system: it expresses a 
state of gossiping pair rather than state of all nodes. The transition probabili- 
ties are used with other frameworks, e.g., mean-field framework and numerical 
simulations, to study the emergent behaviour of system. 

4 Case study: Newscast 

We now briefiy describe our first case study, a simple push-pull information 
propagation protocol. It is a variation of the Newscast protocol [17]. In the 
original version of Newscast, each item is paired with a timestamp indicating 
when it was created. The original Newscast protocol can also be modelled using 
our analytical framework; we will come back to the details in Sec. 7. 
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The basic idea of the protocol is a periodic exchange of data items published 
by nodes in the network. Items can be, for example, a number or network address 
(IP and port) of a node. For now, we assume that nodes do not crash and 
communication channels are failure-free. 



4.1 Specification 

The protocol operates in a wide area network^. A node periodically picks a 
random peer and exchanges s random items with it. The node then selects c 
random items for its new cache among the received items and those in its cache. 
The methods in Fig. 1 for Newscast are summarised in the following table: 



method 


operation 


RandomPeer() 


select a peer uniformly at random 


PrepareMsgO 


select s random items 


Update(<7, cTp) 


store only c random items, 

selected among the items in its cache 

and the received items cTj, 



4.2 Transition diagram 

Wc analyse the spread of a generic item, which we call d, using the framework 
introduced in Sec. 3. As stated previously, there are four possible states of the 
caches of A and B. 

The state (0, 0) has only one outgoing transition, back to itself. P(00|00) = 1 
since if A and B do not have d before their exchange then they clearly still do 
not have d after the exchange. We now determine values for other probabilities 
P(a2b2\aibi). Note that due to the symmetry of information exchange between 
gossiping nodes in the protocol, P{a2b2\aibi) ~ P{b20.2\biai)] and the probabili- 
ties Pdrop and Pseiect are the same for both initiator and contacted node. We use 
the following notations for complementary probabilities: P-^seiect for 1 — Pseiect, 
and P^drop for 1 - Pdrop- 

State (0, 1) Before gossip, d is only in the cache of node B. 
02^2 = 01: B neither sends nor drops d from its cache, or B sends and keeps d, 
and A drops it, i.e., the probability is P(01|01) = P^select ■ P-^drop + Pseiect ■ 

P—' drop ' Pdrop • 

02^2 = 10: B selects and drops d, and A keeps it; i.e., the probability is P(10|01) = 

Pseiect ' Pdrop ' P-^drop- 

a2&2 = 11: B sends d, and both nodes keep it; i.e., P(ll|01) = Pseiect • P^drop ■ 

P—>drop' 

a2&2 = 00: B sends d and both nodes drop it, or B does not send and drops d; 

-P(00|01) — Pseiect ' Pdrop ' Pdrop ^ P-'Select ' Pdrop- 

State (1, 1) Before gossip, d is in the caches of both nodes. 



^ In the network of gossiping nodes, an information dissemination is fully dictated by 
gossiping frequency instead of communication latencies (cf. [17]). 
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02^2 = 01: node A drops d while updating its cache, but B keeps it; i.e., P(01|ll) = 

P^drop ' Pdrop 

0262 — 10: this transition is symmetric to the previous one. 

0262 ~ 11: neither A nor B drops the item d; i.e., P(ll|ll) = {P-,dropY- 

02^2 = 00: both A and B drop d; i.e., P(00|11) = {Pdropf- 

All transition probabilities are summarized in Fig. 3 



P(01|01) 


= P(10 10) 


— [P^seleci + ^select ' Pdrop) ' P^drop 


P(10|01) 


= P(01 10) 


— Pselect ' Pdrop ' P—'drop 


P(ll|01) 


= P(ll 10) 


— Pselect ' P^drop ' P^drop 


P(00|01) 


= P(00 10) 


— ^Pselect ' Pdrop ~t~ P^select^ ' Pdrop 


P(01|ll) 


= P(10 11) 


— P^ drop ' Pdrop 


P(lllll) 


— {P^drop)^ 


P(00|11) = (Pdropf P(00|00) = 1 



Fig. 3. Transition probabilities 



4.3 Building blocks: Pselect and Pdrop 

We now derive the expressions for the probabilities Pselect and Pdrop ■ The follow- 
ing analysis assumes that all node caches are full; that is, the network is already 
running for a while. 

Consider nodes A and B engaged in an exchange, and let B receive the 
exchange buffer Sa from A. Let k be the number of duplicates (see Fig. 4), i.e., 
the items of an intersection of the node cache Cb and the exchange buffer of its 
gossip partner Sa (i.e., SaC^Cb)- 



k = SaHCb 


s = SbCiSa 


(a) 


(b) 



Fig. 4. a) Items sent by A that are in the cache of B; b) Items in common for the 
exchange buffers of both A and B. 

The probability of selecting an item d in the cache is the number of selected 
items s divided by the total number of items in the cache c: Pselect = f ■ Thus, 
the probability that an item d in the cache is not selected is: P-,seiect — 

Among the c items in Cb, there are k items also in Sa', thus, only c random 
items of the total c+s — k items in Cb^{Sa\Cb) can be kept: P-.drop{k) = ^^qifn^- 
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Thus, the probabihty Pdrop that an item in Cb U {Sa\Cb) is dropped from Cb, 
given k items in Sa H Cb is as follows: 



P.™p(/c) = l-^^^. (1) 

Assuming uniform sampling of items, the average value of fc is ,s • ^. Thus, the 
probability of dropping an item after the exchange Pdrop is i+cn/{s(n-c)) ■ Later 
we discuss a case when the assumption of uniform sampling does not hold. 



4.4 Validation 

To validate our theoretical results, wc simulated Newscast in a round-based fash- 
ion similar to simulations in PcerSim [19]. A new item d is initially introduced 
in a network of 2500 nodes at one node. In this fully connected network, caches 
of all nodes are full and uniformly populated by n = 500 items. After each 
gossip round, we measure the total number of copies of d in the network (repli- 
cation property), and how many nodes in total have seen d over time {coverage 
property). 

Simulations with the simplified Newscast Each node in the network has a cache 
size of c = 100. Once in a round, every node selects uniformly at random one 
node from the network of 2500 nodes and exchanges s = 50 random items. To 
make a fair comparison with the simulations with the model, we let the nodes 
gossip for 1000 rounds items other than d; items are replicated and the replicas 
fill the caches of all nodes. At the round 1000, the observed item d is inserted 
into the network at a random location. From that round on, replication and 
coverage are measured by the end of each round. 

Sim,ulations with the model For the simulations with the model, the system 
parameters n, c and s are set to 500, 100 and 50, respectively. Each node in the 
network only maintains a state variable which indicates the presence or absence 
of d. Nodes update their state in pairs according to the transition probabilities 
in Fig. 3. While in the actual protocol, nodes update the contents of their caches, 
in the model simulations, nodes update only their state variables. Since we do 
not need a startup time for the simulations with the model, at the round 0, we 
set the state of a random node to 1, and all others to 0. From that round on, we 
track the states of the nodes. 

The graphs in Fig. 5 depict the performance of the simplified Newscast and 
of the analytical model in terms of replication (left) and coverage (right) of d. 
The results are collected from 1000 simulation runs of the protocol and 1000 
runs of the model. In approximately 72% of the all runs, the observed item 
disappears. In the graphs, however, we only considered the runs where the item 
did not disappear; e.g., to obtain 1000 runs of the protocol displayed in Fig. 5, 
wc performed a total of 3700 runs. Assuming a normal distribution of the results, 
the confidence interval for 95% confidence is 16 times narrower than the standard 
deviations drawn on the graphs. 
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Fig. 5. Clique: Replication (left) and coverage (right) of item d for Newscast and its 
model, with c = 100, n = 500, s = 50. 

Fig. 5 show the average and standard deviation of the successful runs with 
Newscast. The standard deviation bars on the graphs are the standard deviation 
of the rephcas and number of nodes that have seen item in the system. For all 
successful runs, the network reaches the equilibrium, in which there are around 
500 replicas of d. Due to random gossiping, item d is eventually seen by all nodes 
in the network, when coverage reaches all 2500 nodes. 

We have chosen this type of network graph for a basic introduction of our 
framework since a uniform distribution of items over the network is clearly a 
valid assumption in a fully connected ncitwork of nodes executing Ncnvscast. 
We will turn to other types of network graphs in the next case study. Our 
framework allows for modelling of push-only as well pull-only gossip protocols, 
see Appendix A. 



5 Case Study: Shuffle 

We now move on to another case, the ShufHe protocol introduced in [15] and 
originally analyzed in [6] using our framework. This section is intended to make 
the reader familiar with the analysis of this protocol. 



5.1 Specification 

Shuffle aims at the conservation of data in an ad hoc network. Each node main- 
tains a cache of data items that arc disseminated throughout the network. The 
dissemination is done by periodic exchange of s random items between two gos- 
siping peers. 

The procedure of selection of items is similar to the one in Newscast, but 
items are discarded according to a different policy: (i) a node cannot discard an 
item unless it is sent to the gossip partner; (ii) the partner is not allowed to drop 
received items. 

A node can keep only c items in its cache after the exchange. A peer will favor 
dropping items it has just sent over other items in its cache. Note that in this way. 
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an exchanged item will always be preserved, and possibly even replicated. Since 
the peer, in its turn, has agreed to store received items in its cache, discarding 
items does not lead to the loss of information in the network (if there is no 
message loss). The generic routines in Fig. 1 can be summarized for ShufHe as 
follows: 



method 


operation 


RandomPeer() 


select a peer uniformly at random 


PrepareMsgO 


select s random items 


Update(<7, cTp) 


• add CTp received entries to the cache; 

• remove duplicated items; 

• remove items among a \ Up uniformly 
at random until the cache has c items. 



5.2 Treinsition Diagram 

Unlike in Newscast, nodes are not allowed to discard the items received from their 
gossip partners. Taking this difference into account, the transition probabilities 
for ShufHe are quite easy to derive from the transitions in Fig. 3. Note that for 
this protocol the state (0, 0) has only a self-transition, and no other outgoing or 
incoming transitions, because of the preservation nature of the protocol. That 
is, if either nodes send an item, its partner keeps this copy as well, and if an 
item is not among the selected for a shuffle, the item is not replaced by another 
one. The transition probabilities can be easily computed, sec Fig. 6. For more 
details on derivation of these transition probabilities, we refer to [6]. 



P(01|01) = P(10|10) = P^select P(10|01) = P(01|10) = Pselect ' Pdrop 
P(01|ll) = P(10|ll) = P,e led ' P— 'Select ' Pdrop 

P(00|00) = 1 

P(ll|10) = P(ll|01) = prelect ■ P^drop 

P(ll|ll) = 1 — 2 ■ Pselect ■ P^select ' Pdrop 

Fig. 6. Transition probabilities for Shuffle 



Pselect remains the same as for Newscast, Pselect = ^- However, Pdrop is 
different for Shuffle, and we derive it in the next section. 

5.3 Probability of Dropping an item 

Pdrop represents the probability that an item that can be overwritten is indeed 
overwritten by an item received by its node in the exchange. 

Pdrop depends on the number of items both gossiping nodes have in common, 
in particular, how many of such items the contacted node receives during the 
exchange. To derive the expression for Pdrop, we assume a uniform distribution 
of items over the network; in the absence of message loss, this assumption is 
supported by experiments in [15,20,6] and analysis in [8]. 
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Consider again Fig. 4. By assumption, Ca and Cb are a uniform selection 
from the entire population of n items. Sa and Sb are chosen uniformly at random 
from Ca and Cb , respectively. After the exchange, node B has to allocate items 
from Sa in its cache Cb, taking into the account the duplicated items in Sa^Cb 
and Sa^Sb- Basically, the items in Sb that are not in Sa are replaced by items 
in Sa that are not in C^. So, every item from Sa \ Sb that is in Cs means that 
one item from Sb \ Sa can be kept in Cb- In other words, the probability Pdrop 
that an item from Sb U Sa is dropped from Cb equals to the probability that 
an item from Sa \ Sb is not in Cb- By uniform distribution, the probability is 

Thus, Pdrop « — • 
n—s ' ^lop n—s 

This approximation of Pdrop for Shuffle has been successfully established 
through experiments [6], and used for modelling and optimization in [6,5]. 



6 Shuffling through Lossy Channels 

So far, the analysis has relied on the assumption that nodes interact in a perfect, 
lossless communication environment. We now drop this assumption, and consider 
ad hoc networks where nodes are continually communicating with each other over 
an unreliable medium. We now explore the impact of lossy channels on a gossip 
protocol with a push-pull information exchange. 



6.1 Assumptions 

Every message sent now has a fixed, positive probability to be lost due to a 
disturbance of the communication medium. As explained in the introduction, 
we no longer assume that the shufHe procedure is atomic. 

There are three general cases in pairwise communication with respect to 
message delivery, as depicted in Fig. 7: (a) node A initiates a gossip with B by 
sending a message, but the message is lost, (b) node A successfully initiates a 
gossip with B, but a message from B is lost on its way to A, and (c) a gossiping 
pair receives messages from each other. 



6.2 Treinsitions 



We take the analytical model of ShufHe described in Sec. 5 as starting point. In 

our current model, the state (0, 0) is no longer isolated, since there is a possibility 
to remove the only copy of d from the cache of the node B, in the scenario shown 
in Fig. 7(b). 

A B 



(a) 





Fig. 7. Scenarios of communication with lossy channels: (a) loss of the request message, 
(b) loss of the reply message, (c) gossip without loss. 
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We again express the transition probabilities P{a2b2\aibi) in terms of Pseiect, 
Pdrop- To analyze the protocol in the presence of message loss, we introduce into 
the formal model an additional input parameter, the probability Pioss that a 
message is not delivered to a node due to channel loss. We use P^ioss for \ — Pioss- 

State (0, 0) Before shuffle, neither A nor B have d in their cache. 

02^2 = 00: neither A nor B have item d after a shufHe, because neither of them 

had it in the caches before the shuffle: P(00|00) = 1. 
02^2 G {01, 10, 11}: cannot occur, because none of the nodes have item d. 

State (1, 0) Before shuffle, d is only in the cache of node A. 

a2&2 = 01: occurs only when both the request of A and the reply of B are suc- 
cessful and node A selects and drops d: P(01|10) = (P-^joss)^ ■ Pseiect ■ Pdrop- 

a2&2 = 10: B docs not have d. because: (a) A did not select rf, or (b) A selected 
d but the request message got lost on the way to B: P(10|10) = P^seiect + 

Ploss ' Pseiect' 

a2&2 = 11: both nodes A and B have a copy of d because: (a) either both nodes 
received the gossip messages and A selected d and kept it, or (b) A selected d 
and the reply message from B got lost; that is, P(ll|10) = {P^ioss)^ ' Pseiect ■ 

P^drop ^ P^loss ' Ploss ' Pseiect 

02^2 = 00: cannot occur, as A would only drop d if it received a reply, which 
implies that B would keep d. 

State (0, 1) Before shuffle, a copy of d is only in the cache of node B. 

02^2 = 01: only B has d because the message from A got lost or, B received the 

message, and: (a) it did not select d or (b) B selected d, kept it, and reply 

got lost; i.e. the probability is P(01|01) = P^loss ■ {P^select + Ploss ■ Pseiect ■ 
P—'drop^ ~l~ Ploss- 

02^2 = 10: both request and reply messages were successfully delivered and 
B selected and dropped d, which amounts to the probability P(10|01) = 

{P^loss ) ' Pseiect ' Pdrop - 

02 &2 = 11: both messages were delivered successfully, and B selected and kept 

d; that is, P(ll|01) = {P-.ioss) " Ps elect ' P-i drop - 
02^2 = 00: neither of the nodes have d, because B selected and dropped d, but 

A did not receive the reply message. P(00|01) = P-^ioss ■ Ploss ■ Pseiect ■ Pdrop- 

State (1,1) Before shuffle, d is in the caches of A and B. 

a2&2 = 01: only B has d since both messages were successfully received, A se- 
lected and dropped d while B did not select it: P(01|ll) = {P^iossf' ■ Pseiect ■ 

Pdrop ' P—'select- 

a2&2 = 10: only node A has d because node B selected d, dropped it, and node 

A did not select d: P(10|ll) = P^loss ■ P^select ■ Pseiect ■ Pdrop- 

02^2 = 11: after the shuffle both nodes have d, because: 

a) the message from A did not arrive at B, i.e. Pioss', or 

b) the message from A was successfully received by B, but the reply message 
got lost, and: 
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★ A did not select d, while B (i) did not select d as well, or (ii) selected and 

kept d. P^loss ■ P^selecl ' Ploss ' {P^s elect Pselect ' P^drop)'i and 

-k node A selected d: P^ioss ■ Pseiect ■ Pioss; or 
c) both nodes received each other messages, and: 

★ A did not select d, while B (i) also did not select it, or (ii) selected and 

kept it: (P^loss) • P^select • {P^select + Pseleet ' P^drop)', and 

★ (i) A selected and kept d, while B did not select d, or (ii) both nodes 

selected d. {P-,loss) ' Pselect ' {P-tselect ' P^drop "I" Pselect^ 
Hence, P(ll|ll) = 1 - (2 - Pioss ■ (3 - Ploss)) ■ Pselect ■ P'.select ' Pdrop- 

02^2 = 00: discarding of an item by both nodes cannot occur. 

While Ploss expresses the reliability of the communication channels, Pselect 
and Pdrop are derived based on the behavior of the protocol. Pselect is again the 
random selection of items from the cache; an item has ^ chance of being selected, 
regardless of message loss. The calculation of Pdrop is complex, and we discuss 
it in the remainder of this section. 

6.3 The Probability of Dropping an Item 

We now analyze how message loss affects one of the building blocks of our model: 
Pdrop- In Sec. 5.3, we have already found an expression for Pdrop in the case 
of Shuffle, relying on the assumption of a uniform distribution of items. Here, 
we revisit Pdrop and derive a general formula without making any assumptions 
about the distribution of items. Later on, we will explore how message loss affects 
the distribution of items, and will determine that it is the coupling of message 
loss and the topology of the network that affects Pdrop- Having isolated the 
component of Pdrop that is affected by the message loss/topology coupling, we 
will propose the use of statistical data to calculate that specific part of the Pdrop 
expression. 

When a node selects an item to be sent to a neighbor, there is a probability 
Pdrop that the item will be dropped from the node's cache after the gossip 
exchange. The selected item may be dropped only when there is a need to create 
space for an item received from a gossip partner. Therefore, the probability Pdrop 
depends on the relationship between a) the new items received from the gossip 
partner (for which the node needs space, the shaded area in Fig. 8(b), referred 
to as ^i), and b) the items that the node has selected to send to the gossip 
partner and is allowed to discard (the shaded area in Fig. 8(c), referred to as 
^2)- In order to find an expression for Pdrop, we need to calculate the probability 
of an item being in Ai and the probability of an item being in A2 , which we will 
denote as P{Ai) and P{A2), respectively. 

Given two nodes, A and B, that engage in a gossip exchange, we can represent 
their caches as sets Ca and Cb , and the sets of items they exchange with each 
other by Sa and Sb, respectively. The sets Ca and Cb may have common 
elements (items), see Fig. 8(a). We define Pinx as the probability of any item 
found in one of the caches to be also found in the other. Knowing Pinx, we can 
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formulate P{Ai) as Pselect ■ (1 - Pinx) and P{A2) as Pselect ■ (1 - Pselect ■ Pinx)- 

The probability Pdrop can then be expressed as: 

P(A2) 1 — Pselect ' -Pina; 

In order to calculate a value for Pdrop , it is necessary to have a value for the prob- 
ability Pinx- For the case of Ca and Cb being random samples of a population 
of n items, that is, under the assumption that items are uniformly distributed 
over the network we can analytically deduce that Pinx = ^- For networks with 
perfect communication channels, repeated execution of Shuffle results in a uni- 
form distribution of items. In the next section, we look at the effect that lossy 
channels have on the distribution of items. 



6.4 Uniform Distribution of Items: Does it Still Hold? 

As already mentioned, the calculation of Pdrop in the presence of message loss 
requires us to speculate about the contents of the gossip partner's cache. As- 
suming that the items in the caches of both gossip partners are random samples 
of the totality of items in the network, we can easily estimate Pinx as ^ and, 
therefore, have an analytical expression for Pdrop ~ fiEf ■ In this section, we ver- 
ify whether the uniform distribution of items, observed under no message loss, 
is still a valid assumption. 



The Importance of Randomness 

To understand the effect of message loss on abstract level, consider one node in 
the network, executing Shuffle. We examine the scenario from Fig. 7(b), where 
a reply message from a node to the initiating node is lost due to channel failure. 
When two nodes A and B gossip their local items to each other, the probability 
that the message from B to A is lost, is P^ioss ■ Pioss- If the message of B is not 
delivered to A, items cUSa will be common to cache Ca and cache Cb after the 
shuffle, since B, unaware of the failure, purges the sent items Sb, and A, which 
received no reply, keeps its Sa items (see Fig. 9). 

Over time neighboring nodes have a growing intersection of items in their 
cache. If a node has a small neighborhood, the random sample becomes more 



c = Ca n Cb 


Ai = Sa\ Cb 


A2 = Sb\ Sa 








(a) 


(b) 





Fig. 8. a) Items in common for the caches of both nodes A and B; b) Items sent by A 
that are not in the cache of B; c) Items sent by B that are not in the exchange buffer 
of A. 
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Fig. 9. Correlation of the caches increases due to message loss 



biased towards the collection of items left in the neighborhood. Limited by the 
access to the storage space of the neighborhood, a set of neighbors maintains only 
a subset of all items. In general, the stronger the correlation between the caches 
content of the neighbours, the slower the spread of observed item throughout 
the network. 

For the fully connected graph, since it is the entire collection of the items ev- 
ery node has access to, the uniform distribution assumption in general remains. 
A larger neighborhood reduces the probability of repeated communication be- 
tween two nodes, and due to the increased communication range, there is a faster 
exchange between distant areas of the network. 

Experimental Observations 

To support our claims that message loss affects the distribution of items, we 
conducted simulation experiments. Each simulation experiment has a startup 
period of 1000 rounds, during which N — 2500 nodes gossip n — 500 different 
items under no message loss. We use three different network topologies: a fully 
connected network, a square grid with each internal node having 4 neighbors, 
and a network where every node has 4 randomly chosen neighbors (outdegree 
4). By the end of the startup period, the items have been replicated, achieving 
uniform distribution. That is, every item has a probability ^ = = 0.2 of 
being present in a given node's cache. After the startup period has finished, the 
communication channel is set up to fail with a probability Pioss- We calculate, 
per round, the probabilities P(ll), P{10) and ^'(01) that an exchange involves 
both nodes having item x, only the initiator having item x, or only the gossip 
partner having item x, respectively. 

Fig. 10 shows the average values of ^'(ll), -P(IO) and ^'(Ol) over 1000 rounds 
for different values of Pioss using the three topologies mentioned earlier. As 
expected, for the case of no message loss {Pioss — 0) the probabilities suggest a 
uniform distribution of items in the network. We can analytically deduce their 
expected value, which matches the experimental results, with P{11) as - • - = 
0.04 and P(10) = P(01) as f • (1 - ^) = 0.16. However, as the probability of 
message loss increases, we observe some changes in P(ll), P(10) and P(01). 
For the case of the fully connected network, the probabilities remain stable, 
suggesting that the uniform distribution of items remains unaffected by message 
loss. With the other topologies, however, as message loss increases, the number 
of gossip interactions between nodes that both have item x also rises. Since 
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Fig. 10. Probability that a gossip exchange occurs between a) a node that does not 
have item x and a node that has the item, b) a node that has item x and one that does 
not, and c) two nodes have item x, for different topologies. 



item X is representative of all items in the network, the graphs suggest that, 
as a consequence of message loss, gossip partners have more items in common 
than they would have if items were uniformly distributed. In other words, due to 
message loss, a node is more likely to have items in common with a neighbor than 
with another random node in the network. Hence, the topology of the underlying 
network now plays a role in calculating Pdrop- 

6.5 Calculating Pdrop based on Statistical Data 

By now, we have established that in case of message loss the structure of neigh- 
borhoods plays an important role in determining the distribution of items, which, 
in turn, determines the probability of an item found in a node's cache to be also 
found in the gossip partner's cache, Pmx- Our calculation of Pdrop depends on 
finding a value for Pmx- This topology varying component can be modelled an- 
alytically or measured experimentally from a single run of the protocol, for a 
given topology. Here, we opt for obtaining Pmx for a given network graph from 
statistical data collected from experiments. With Pmx, we can proceed to calcu- 
late Pdrop , obtaining the final building block of the model needed for validation. 
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We can calculate Pinx from the probabilities -P(ll), -P(IO) and ^'(01) measured 
experimentally in the previous section: 

p _ P(ll) 
™^ P(10) + P(ll) 

The left graph of Fig. 11 shows the values for Pinx calculated for each experiment 
using a different Pioss- Based on these values, we compute Pdrop using (2), as 
seen in right graph of Fig. 11. As expected, the calculated values show that, 
in the face of message loss, different topologies yield different probabilities of 
dropping an item. Pdrop drops more harshly in the more clustered topologies, 
which suffer more from neighboring nodes having similar items as message loss 
increases. 
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Fig. 11. Probabilities Pinx (left) and Pdrop (right), for different topologies. 



6.6 Experimental Evaluation 

The experiments in this section simulate the case where a new item d is intro- 
duced by one node in the network in which all caches are full and uniformly 
populated by n = 500 items. Each experiment takes as input the topology of the 
network to determine which pairs of nodes can gossip. In all cases, we use a net- 
work of TV = 2500 nodes arranged in either of the three topologies mentioned in 
the previous section (a fully connected network, a square grid or a graph where 
every node has 4 randomly chosen neighbors). In the experiments that follow, 
after each gossip round, wc measure the total number of occurrences of d in the 
network (replication), and how many nodes in total have seen d (coverage). 

Simulations with Shuffle Each node in the network has a cache size of c = 100, 
and sends s = 50 items when gossiping. In each round, every node randomly 
selects one of its neighbors and shuffles. In order to make a fair comparison with 
the simulations with the model, wc let the nodes gossip for 1000 rounds with 
Pioss = 0, to ensure that none of the n = 500 initial items is lost in the startup 
period before initiating the measurements of the properties. After this startup 
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period of 1000 rounds, items are replicated and the replicas fill the caches of all 
nodes. At round 1000, Pioss is set to the desired value and item d is inserted into 
the network at a random location. From that moment on, we track its replication 
and coverage. 



Simulations with the model For the simulations with the model, n, c and s are 
system parameters set to 500, 100 and 50, respectively. Pinx is determined ex- 
perimentally on the network, and Pdrop is calculated using equation (2). Instead 
of maintaining a cache, each node in the network only maintains a variable that 
represents whether it holds item d or not (state 1 or 0, respectively). Nodes 
update their state in pairs according to the transition probabilities introduced 
before, see Fig. 2. This mimics an actual exchange of items between a pair of 
nodes according to Shuffle. While in the protocol this results in both nodes up- 
dating the contents of their caches, in a simulation using the analytical model 
updating the state of a node refers to updating only one variable: whether the 
node is in possession of item d or not. At round we set the state of a random 
node to 1 (while all the others have state 0) and track the state of the nodes for 
the remainder of the simulation. 

Figs 12 and 13 show the behavior of Shuffle (top row) and how it compares 
to the analytical model (bottom row) in terms of replication (left) and coverage 
(right) of d, for different values of Pi 
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Fig. 12. Grid, range 1: Top: Replication (left) and coverage (right) of item d for the 
shuffle. Bottom: difference between the shuffle and the model for replication (left) and 
coverage (right). 
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Fig. 13. Topology with outdegree 4: Top: Replication (left) and coverage (right) 
of item d for the shuffle. Bottom: difference between the shuffle and the model for 
replication (left) and coverage (right). 



runs (for both Shuffle and its model) were executed. Due to message loss, it is 
possible for item d to disappear after being introduced into the network (usually 
in the first few rounds). As Pioss increases, this situation is more likely. In the 
graphs, wc only take into account the successful runs, i.e., where the item did 
not disappear, but spread. 

The top rows of Figs 12 and 13 show the average and standard deviation of 
the successful runs with Shuffle. We compare this data with the avciiage of the 
successful runs of the model, and present the difference (in absolute value) in 
the bottom rows of Figs 12 and 13. We include the standard deviation for the 
shufHe in the bottom row for comparison. It clearly shows that the difference 
between the results obtained from the shufHe and the model fall well within the 
standard deviation of the shufHe results, confirming the ability of the model in 
predicting the average behavior of the protocol. 

In all successful runs (despite message loss), the network converges to a sit- 
uation in which there are roughly 500 copies of d, indicating that after repeated 
execution of the protocol d receives a fair share of the storage space in the net- 
work; 2500 • 100 cache slots divided between 500 items. Also, as expected, due to 
random gossiping item, d is eventually seen by all nodes in the network, when 
coverage reaches 100%. 
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7 Broadening the Scope: a Peer-Sampling Protocol 

In the same way that we have developed models for information dissemination 
algorithms, we can apply our methodology to other types of gossip protocols. In 
this section, wc describe, in broad terms, how to apply our modelling approach 
to Cyclon, a gossip-based peer-sampling protocol. 

7.1 The Cyclon Protocol 

Peer sampling is a service that nodes in largo-scale distributed systems can call 
to obtain a random peer to gossip with. Cyclon implements a peer sampling 
service by constructing and maintaining an unstructured overlay using gossiping 
membership information. As opposed to Shuffle, the data items exchanged by 
Cyclon are references to other nodes in the network. The references, or links, 
that a node stores in its cache represent the nodes that it can gossip with. The 
collection of links in the network constitiitcs an ever-changing overlay over which 
links are exchanged. The aim of Cyclon is to keep the caches of the nodes in the 
network populated with a random selection of links to other nodes. The following 
table summarizes the protocol: 



method 


operation 


RandomPeer() 


Select s random items and place them in cr. 

From a, select a gossip partner uniformly at random. 


PrepareMsgO 


If the node initiates the exchange, replace the link to the 
gossip partner with a link to itself in a. Return a. 


Update(CT, cTp) 


Discard links to the node itself and items that are already 

in the cache from Up. 

Include remaining items in cache by 1) using empty slots 
and 2) replacing entries among the ones in a. 



Note that the steps taken within the routines of Cyclon depend on the role 
that a node has in the gossip exchange. The node that initiates the exchange 
always includes a link to itself in the message, while the contacted node does 
not. This is reflected in the PrepareMsgO routine, which is slightly different for the 
initiator. 

7.2 Modelling Cyclon: States and Transitions 

According to Cyclon, every link d has a publisher D, and whenever D gossips, 
it sends the link d to its partner. Thus, when modelling the dissemination of 
link d throughout the network, we have to take into account whether D (the 
source of link d) is involved in the exchange. Therefore, we deflne the state of 
a node as a two-bit string, where the first bit indicates whether the node is D 
and the second bit indicates whether link d is in the node's cache (as done in 
the previous models). 

As nodes update their states in pairs, and with each node having a state 
represented by two bits, the state of a pair of nodes engaged in a Cyclon ex- 
change consists of a four-bit string. While this opens up to the possibility for 
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16 states, only 8 states are valid. Cyclon precludes a node from storing its own 
link, eliminating seven states (where one of the nodes has state 11). In addition, 
the state 1010, where D contacts itself, is also invalid. 

Fig. 14 shows the transitions between states where D is not involved. For 
simplicity, we assume that there are no link failures, resulting in a transition 
diagram similar to what we would encounter for Shuffle. Note that the four 
states depicted have the form OXOX, indicating that neither the initiator nor the 
gossip partner is D. 

When D is involved in the exchange, there are only four possible states, as 
shown in Fig. 15. Since D as the initiator always includes a link to itself in the 
set of items sent, the outcome will always be that the other node has a link d. 
On the other hand, when D is the contacted node (see bottom of figure), the 
initiator might or might not drop link d from its cache as a result. In all cases, 
D always keeps the state 10, as it can never keep a link to itself in its cache. 

Describing the transitions in terms of the building blocks Pseiect and Pdrop 
can be done in a similar way as with Shuffle (when D is not involved in the 
exchange). The variations in the routines, specifically PrepareMsg(), due to the 
role taken by the node in the exchange, have to be given special consideration, as 
they will cause Pseiect to be slightly different for the initiator and the contacted 
node. When node D is involved, Cyclon is very clear about the outcome. The 
transitions can be easily calculated. If node D is the initiator, there is only 
one possible outcome (probability = 1). If node D is the contacted node, the 
transitions depend solely on Pdrop- 



7.3 Other Protocols 

As mentioned before, in the original version of Newscast, data items are times- 
tamped according to their creation time. A node allocates the space in its cache 
for the received items by discarding corresponding number of oldest items. If an 
item is received that is also in the node's cache, the recent version of the item is 
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preferred. Like for Cyclon, a state of the transition diagram of Newscast is then 
a pair of tuples (a, b), where a,b G {0, (1, i)}. The first bit indicates if the data 
item is present in the gossiping node's cache. The integer i is a timestamp of 
the corresponding item. The expression for probabiUty Pseiect remains the same, 
but Pdrop is then a function of the corresponding timestamp. It can be either 
derived through rigorous analysis, or can be measured experimentally similar to 
the approach presented in Sec. 6. 

8 Conclusions 

Traditional analysis methods for computer systems such as model checking with 
its exhaustive state space search, fail to cope with large networks. Although quite 
useful for studying certain behavior of small-sized networks, these methods do 
not scale well for gossip protocols. On the other hand, standard methods for 
modelling of epidemics scale well, but abstract away the details of a protocol, 
showing only simple emergent behavior of the system. A challenge is to develop 
analytical models that capture (part of) the behavior of a system, and then 
subsequently optimize design parameters, at the right level of abstraction. 

In this paper, wc presented an analytical framework for a class of gossip-based 
information dissemination protocols. Modelling a gossip protocol at the level of 
local pairwise interactions, as demonstrated in this paper, is a scalable approach 
to analyze such protocols. On the one hand, this analytical model includes the 
mechanics of communication between nodes on the level of the protocol details. 
On the other hand, such a model allows for studying the impact of system 
parameters on the performance of the protocol, and can be used to optimally 
design and fine-tune it. 

Furthermore, we have presented the analysis of a gossip protocol in the pres- 
ence of transient communication failures. For our framework, wc introduced a 
hybrid method to compute Pdrop that combines both rigorous modelling of the 
protocol and statistical data sampling from large-scale Monte Carlo simulations 
for different network topologies. To be more precise, wc derive analytical expres- 
sions for components of our framework that are invariant with respect to the 
scenarios we want to model. Having decomposed our model into its basic com- 
ponents, we identify the ones that arc affected by specific scenario configurations 
(in this case, message loss and topology combinations). Finding analytical ex- 
pressions for these components (in this work, only Pinx 

) would require modelling 

each specific scenario configuration, reducing the applicability of the framework 
to those very specific configurations. Instead, we isolate the framework from the 
scenario-specific component and use statistical data sampling to obtain a value 
for it. 

We opt for this combined approach since it allows us to build a framework 
that captures the behavior of the gossip protocol without requiring us to incor- 
porate scenario-specific elements into the model. The challenge in this approach 
lies in being able to decompose the model into its minimal components, in such 
a way that the ones which are dependent on particular scenarios (for which ex- 
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pressions that encompass all scenarios cannot be derived) can be isolated and 
computed separately. In effect, we strive for a golden mean between high-level 
models such as for epidemics showing only the emergent behavior and the low- 
level models of the protocol that depend on particular implementation settings. 

With respect to gossip-based dissemination, our study revealed that for net- 
works with lossy communication channels, the assumption of a uniform distri- 
bution of data is valid only if every node can gossip with any other node in the 
network. In future, we plan to study the impact of different gossiping frequency 
on the performance results. 

References 

1. A. AUavona, A. Dcmcrs, and J. Hopcroft. Correctness of a gossip based membership 
protocol. In Proc. of PODC, pages 292-301. ACM, 2005. 

2. R. Bakhshi. Gossiping Models: Formal Analysts of Epidemic Protocols. PhD thesis, 
Vrije Universiteit Amsterdam, 2011. http://hdl.handle.iiet/1871/18387. 

3. R. Bakhshi, L. Cloth, W. Fokkink, and B. R. Haverkort. Mean-field framework 
for performance evaluation of push-pull gossip protocols. Performance Evaluation, 
68(2):157-179, 2011. 

4. R. Bakhshi, J. Endrullis, S. EndruUis, W. Fokkink, and B. Haverkort. Automating 
the mean-field method for large dynamic gossip networks. In Proc. of QEST, pages 
241-250. IEEE Computer Society, 2010. 

5. R. Bakhshi and A. Fchnkcr. On the impact of modelling choices for distributed 
information spread. In Proc. of QEST, pages 41-50. IEEE Computer Society, 2009. 

6. R. Bakhshi, D. Gavidia, W. Fokkink, and M. van Steen. An analytical model of 
information dissemination for a gossip-based protocol. Camp. Netw., 53(13):2288- 
2303, 2009. 

7. S. Boyd, A. Ghosh, B. Prabhakar, and D. Shah. Randomized gossip algorithms. 
IEEE/ACM Trans, on Networking (TON), 14(SI):2508-2530, 2006. 

8. Y. Busnel, R. Beraldi, and R. Baldoni. A formal characterization of uniform peer 
sampling based on view shuffling. In Proc. of PDCAT, pages 360-365, 2009. Ex- 
tended version to appear in JPDC. 

9. P. Costa, V. Gramoli, M. Jelasity, G. P. Jesi, E. Le Merrer, A. Montresor, and 
L. Querzoni. Exploring the interdisciplinary connections of gossip-based systems. 
ACM SIGOPS Oper Syst. Rev., 41(5):51-60, 2007. 

10. P. Crouzcn, J. van de Pol, and A. Rensink. Applying formal methods to gos- 
siping networks with mCRL and GROOVE. ACM SIGMETRICS Performance 
Evaluation Review, 36(3):7-16, 2008. 

11. A. Demers, D. Greene, C. Hauser, W. Irish, J. Larson, S. Shenker, H. Sturgis, 
D. Swinehart, and D. Terry. Epidemic algorithms for replicated database mainte- 
nance. In Proc. of PODC, pages 1-12. ACM Press, 1987. 

12. A. Dimakis, A. Sarwate, and M. Wainwright. Geographic gossip: efficient aggre- 
gation for sensor networks. In Proc. of IPSN, pages 69-76. ACM Press, 2006. 

13. N. Drost, E. Ogston, R. van Nieuwpoort, and H. Bal. ARRG: Real- World Gossip- 
ing. In Proc. of HPDC, pages 147-158. ACM, 2007. 

14. A. Fehnker and P. Gao. Formal verification and simulation for performance analysis 
for probabilistic broadcast protocols. In Proc. of ADHOC-NOW, volume 4104 of 
LNCS, pages 128-141. Springer, 2006. 



23 



15. D. Gavidia, S. Voulgaris, and M. van Steen. A gossip-based distributed news service 
for wireless mesh networks. In Proc. of WONS, pages 59-67. IEEE, 2006. 

16. M. Gurevich and I. Keidar. Correctness of gossip-based membership under message 
loss. In Proc. of PODC, pages 151-160. ACM, 2009. 

17. M. Jelasity, W. Kowalczyk, and M. van Steen. Newscast computing. Technical 
Report IR-CS-006, Vrije Universiteit, 2003. 

18. M. Jelasity, W. Kowalczyk, and M. van Steen. Newscast computing. Technical 
Report IR-CS-006, Vrije Universiteit Amsterdam, 2003. 

19. M. Jelasity, A. Montresor, G. P. Jesi, and S. Voulgaris. PeerSim: A peer-to-peer 
simulator, http : / /peer sim . sour cef orge . net/. 

20. M. Jelasity, S. Voulgaris, R. Guerraoui, A.-M. Kermarrec, and M. van Steen. 
Gossip-based peer sampling. ACM Trans, on Comput. Syst., 25(3) :8, 2007. 

21. M.-J. Lin and K. MarzuUo. Directional gossip: Gossip in a wide area network. In 
Proc. of EDCC-3, pages 364 379. Springer, 1999. 

22. A. Stavrou, D. Rubenstein, and S. Sahu. A lightweight, robust p2p system to 
handle flash crowds. In Proc. of ICNP, pages 226-235. IEEE, 2002. 

23. A. Tanenbaum and M. van Steen. Distributed Systems: Principles and Paradigms. 
Prentice Hall, 2007. 

24. S. Voulgaris, D. Gavidia, and M. van Steen. Cyclon: Inexpensive membership man- 
agement for unstructured p2p overlays. J. Network and Syst. Manage., 13(2):197- 
217, 2005. 

A Pull-only and push-only veiriations 

Our framework is not limited to p\ish-pull gossip protocols, but allows for mod- 
elling of push-only as well pull-only gossip protocols. To illustrate this, we use 
two variations of the simplified Newscast, and briefly demonstrate the calculation 
of transition probabilities for push and pull versions of the protocol. 

Push-only protocol 

Recall that in push-based gossip protocols, only the initiator node A sends its 
message to B, but docs not update its own cache. Therefore, the transitions 
where A changes its state are not possible anymore, and the probability of such 
transitions is 0. The transition probability P(00|00) = 1 since items do not 
appear from nowhere. 

State (0, 1) Before gossip, only B has d in its cache. 

a2&2 = 01: node B does not drop d from its cache; F(01|01) = P^drop- 

02^2 = 00: node B drops d during the update of its cache; P(00|01) = Pdrop- 

State (1, 0) Before gossip, d is in the cache of A. 

02^2 ~ 10: node A either docs not send rf, or sends it, but B drops d, i.e., the 

probability is P(10|10) = P^select + Pselect ■ Pdrop- 

0262 = 11: node A sends the item d, and node B keeps it: P(ll|10) = Pselect ■ 

P—> drop- 

State (1, 1) Before gossip, d is in the caches of both nodes. 

02^2 = 10: node B drops d from its cache during the update; P(10|ll) = Pdrop- 

0262 = 11: node B keeps the item d after the update; i.e., P(ll|ll) = P^drop- 
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Pull-only protocol 



In contrast to the push version, in pull-based gossip protocols the initiator node 
only requests data from its peer. In terms of state transitions, only initiator 
A can update its state, but not the contacted node B. Thus, the probability 
of transitions in which the state of B before gossip differs from its state after 
the gossip is 0. The state (0,0) again has a self-transition with probability 1: 
P(00|00) = 1. 

State (0, 1) Before gossip, only B has d in its cache. 

a2&2 = 01: node A does not get d from B, or B sends the item, but A drops it; 

P(01|01) = P^select + Pselect ' Pdrop- 

02^2 = 11: node B sends d and node A keeps it; P(ll|01) = Pselect ■ P^drop- 
State (1, 0) Before gossip, d is in the cache of A. 

a2&2 = 10: node A does not drop d after its cache update, i.e., the probability 

is P(10|10) = P^drop- 

02^2 = 00: node A does not keep the item for its updated cache: P(00|10) = 

Pdrop • 

State (1, 1) Before gossip, d is in the caches of both nodes. 

02^2 = 01: node A drops d from its cache during the update; P(01|ll) = Pdrop- 

02^2 = 11: node A keeps the item d after the update with the probability P(ll|ll) = 

P—^drop' 
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